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(1) GENERAL INFORMATION: 

(i\ APPLICANT: Reed, John 
(ii) \lTLE OF INVENTION: Regulation of bcl-2 Gene Expression 
(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ^ADDRESSEE: LONG , ALDRIDGE & NORMAN LLP 

(B) STREET: Suite 600, 701 Pennsylvania Avenue, N.W. 

( C) CITY : Washington 

(D) STATUE: District of Columbia 

(E) COUNTY: U.S.A. 

(F) ZIP: \ooo4 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
CAGCGTGCGC CATCCTTCCC 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

J(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
[TTTTCCTCT GGGAAGGATG GCGCACGCTG GGAGA 

(2) INFORMATION FOR SEQ ID NO : 3 : 

i (i) SEQUENCE CHARACTERISTICS: 
w l (A) LENGTH: 2 0 base pairs 

{j (B) TYPE: nucleic acid 

"\ (C) STRANDEDNESS: single 

Uf^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( iv) ANTI-SENSE : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:: 
GATGCACCTA CCCAGCCTCC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
ACGGGGTACG GAGGCTGGGT AGGTGCATCT GGT 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

1(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
AAAGGCAT CCTGCAGTTG 



(2) INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

MOLECULE TYPE: DNA (genomic) 
ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NO: 
CCCCCAACTG CAGGATGCCT TTGTGGAACT GTACGG 



(i) 




(ii) 



(iv) 
(xi) 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGGAAGGATG GCGCACGCTG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 
m (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
CfCGTGCGAC CCTCTTG 



(2) INFORMATION FOR SEQ ID NO: 9: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

MOLECULE TYPE: DNA (genomic) 
ANTI-SENSE: YES 

SEQUENCE DESCRIPTION: SEQ ID NO: 9 
TACCGCGTGC GACCCTC 




(ii) 
(iv) 

(xi) 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TCCTACCGCG TGCGACC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

1(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TTCCTACC GCGTGCG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 




(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GACCCTTCCT ACCGCGT 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GGAGACCCTT CCTACCG 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
d^GGCGGCAG CGCGG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
I (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 
CGGCGGGGCG ACGGA 




• 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .16 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGGAGCGCG GCGGGC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTCCCAGCG TGCGCCAT 



(Z) INFORMATION FOR SEQ ID NO: 18: 

' (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
\ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

18 

TGCACTCACG CTCGGCCT 




(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGCCCGCCC CTCCGCGCCG CCTGCCCGCC CGCCCGCCGC GCTCCCGCCC GCCGCTCTCC 60 

GTGGCCCCGC CGCGCTGCCG CCGCCGCCGC TGCCAGCGAA GGTGCCGGGG CTCCGGGCCC 12 0 

TCCCTGCCGG CGGCCGTCAG CGCTCGGAGC GAACTGCGCG ACGGGAGGTC CGGGAGGCGA 18 0 

CCGTAGTCGC GCCGCCGCGC AGGACCAGGA GGAGGAGAAA GGGTGCGCAG CCCGGAGGCG 240 

GGGTGCGCCG GTGGGGTGCA GCGGAAGAGG GGGTCCAGGG GGGAGAACTT CGTAGCAGTC 3 00 

ATCCTTTTTA GGAAAAGAGG GAAAAAATAA AACCCTCCCC CACCACCTCC TTCTCCCCAC 3 60 

CCCTCGCCGC ACCACACACA GCGCGGGCTT CTAGCGCTCG GCACCGGCGG GCCAGGCGCG 420 

TCCTGCCTTC ATTTATCCAG CAGCTTTTCG GAAAATGCAT TTGCTGTTCG GAGTTTAATC 480 

AGAAGACGAT TCCTGCCTCC GTCCCCGGCT CCTTCATCGT CCCATCTCCC CTGTCTCTCT 54 0 

CCTGGGGAGG CGTGAAGCGG TCCCGTGGAT AGAGATTCAT GCCTGTGTCC GCGCGTGTGT 600 

GCGCGCGTAT AAATTGCCGA GAAGGGGAAA ACATCACAGG ACTTCTGCGA ATACCGGACT 660 

GAAAATTGTA ATTCATCTGC CGCCGCCGCT GCCAAAAAAA AACTCGAGCT CTTGAGATCT 72 0 

CCGGTTGGGA TTCCTGCGGA TTGACATTTC TGTGAAGCAG AAGTCTGGGA ATCGATCTGG 78 0 

AAATCCTCCT AATTTTTACT CCCTCTCCCC CCGACTCCTG ATTCATTGGG AAGTTTCAAA 84 0 

TCAGCTATAA CTGGAGAGTG CTGAAGATTG ATGGGATCGT TGCCTTATGC ATTTGTTTTG 9 00 

GTTTTACAAA AAGGAAACTT GACAGAGGAT CATGCTGTAC TTAAAAAATA CAAGTAAGTC 9 60 

TCGCACAGGA AATTGGTTTA ATGTAACTTT CAATGGAAAC CTTTGAGATT TTTTACTTAA 102 0 

AGTGCATTCG AGTAAATTTA ATTTCCAGGC AGCTTAATAC ATTGTTTTTA GCCGTGTTAC 1080 

TTGTAGTGTG TATGCCCTGC TTTCACTCAG TGTGTACAGG GAAACGCACC TGATTTTTTA 114 0 

CTTATTAGTT TGTTTTTTCT TTAACCTTTC AG CAT C AC AG AGGAAGTAGA CTGATATTAA 12 00 

| CAATACTTAC TAATAATAAC GTGCCTCATG AAATAAAGAT CCGAAAGGAA TTGGAATAAA 12 60 

jlAATTTCCTGC GTCTCATGCC AAGAGGGAAA CACCAGAATC AAGTGTTCCG CGTGATTGAA 13 2 0 

^GACACCCCCT CGTCCAAGAA TGCAAAGCAC AT C C AAT AAA ATAGCTGGAT TATAACTCCT 13 8 0 

CTTCTTTCTC TGGGGGCCGT GGGGTGGGAG CTGGGGCGAG AGGTGCCGTT GGCCCCCGTT 14 4 0 

GCTTTTCCTC TGGGAAGGAT GGCGCACGCT GGGAGAACGG GGTACGACAA CCGGGAGATA 1500 

GTGATGAAGT ACATCCATTA TAAGCTGTCG CAGAGGGGCT ACGAGTGGGA TGCGGGAGAT 15 60 

GTGGGCGCCG CGCCCCCGGG GGCCGCCCCC GCACCGGGCA TCTTCTCCTC CCAGCCCGGG 1620 

CACACGCCCC ATCCAGCCGC ATCCCGCGAC CCGGTCGCCA GGACCTCGCC GCTGCAGACC 168 0 
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CCGGCTGCCC CCGGCGCCGC CGCGGGGCCT GCGCTCAGCC CGGTGCCACC TGTGGTCCAC 174 0 

CTGGCCCTCC GCCAAGCCGG CGACGACTTC TCCCGCCGCT ACCGCGGCGA CTTCGCCGAG 1800 

ATGTCCAGCC AGCTGCACCT GACGCCCTTC ACCGCGCGGG GACGCTTTGC CACGGTGGTG 18 60 

GAGGAGCTCT TCAGGGACGG GGTGAACTGG GGGAGGATTG TGGCCTTCTT TGAGTTCGGT 1920 

GGGGTCATGT GTGTGGAGAG CGTCAACCGG GAGATGTCGC CCCTGGTGGA CAACATCGCC 1980 

CTGTGGATGA CTGAGTACCT GAACCGGCAC CTGCACACCT GGATCCAGGA TAACGGAGGC 2 04 0 

TGGGATGCCT TTGTGGAACT GTACGGCCCC AGCATGCGGC CTCTGTTTGA TTTCTCCTGG 2100 

CTGTCTCTGA AGACTCTGCT CAGTTTGGCC CTGGTGGGAG CTTGCATCAC CCTGGGTGCC 2160 

TATCTGAGCC ACAAGTGAAG TCAACATGCC TGCCCCAAAC AAATATGCAA AAGGTTCACT 22 20 

AAAGCAGTAG AAATAATATG CATTGTCAGT GATGTACCAT GAAACAAAGC TGCAGGCTGT 22 80 

TTAAGAAAAA ATAACACACA TATAAACATC ACACACACAG ACAGACACAC ACACACACAA 2340 

CAATTAACAG TCTTCAGGCA AAACGTCGAA TCAGCTATTT ACTGCCAAAG GGAAATATCA 24 00 

TTTATTTTTT ACATTATTAA GAAAAAAGAT TTATTTATTT AAGACAGTCC CATCAAAACT 2 4 60 

CCGTCTTTGG AAATCCGACC ACTAATTGCC AAACACCGCT TCGTGTGGCT CCACCTGGAT 2 52 0 

GTTCTGTGCC TGTAAACATA GATTCGCTTT CCATGTTGTT GGCCGGATCA CCATCTGAAG 2 580 

A'GCAGACGGA TGGAAAAAGG ACCTGATCAT TGGGGAAGCT GGCTTTCTGG CTGCTGGAGG 2640 
CTGGGGAGAA GGTGTTCATT CACTTGCATT TCTTTGCCCT GGGGGCGTGA TATTAACAGA 
GGGAGGGTTC CCGTGGGGGG AAGTCCATGC CTCCCTGGCC TGAAGAAGAG ACTCTTTGCA 

TATGACTCAC ATGATGCATA CCTGGTGGGA GGAAAAGAGT TGGGAACTTC AGATGGACCT 2820 
AGTACCCACT GAGATTTCCA CGCCGAAGGA CAGCGATGGG AAAAATGCCC TTAAATCATA 
GGAAAGTATT TTTTTAAGCT ACCAATTGTG CCGAGAAAAG CATTTTAGCA ATTTATACAA 
TATCATCCAG TACCTTAAAC CCTGATTGTG TAT ATT CAT A TATTTTGGAT ACGCACCCCC 
CAACTCCCAA TACTGGCTCT GTCTGAGTAA GAAACAGAAT CCTCTGGAAC TTGAGGAAGT 

^GAACATTTCG GTGACTTCCG ATCAGGAAGG CTAGAGTTAC CCAGAGCATC AGGCCGCCAC 312 0 
AAGTGCCTGC TTTTAGGAGA CCGAAGTCCG CAGAACCTAC CTGTGTCCCA GCTTGGAGGC 
CTGGTCCTGG AACTGAGCCG GGCCCTCACT GGCCTCCTCC AGGGATGATC AACAGGGTAG 
TGTGGTCTCC GAATGTCTGG AAGCTGATGG ATGGAGCTCA GAATTCCACT GTCAAGAAAG 
AGCAGTAGAG GGGTGTGGCT GGGCCTGTCA CCCTGGGGCC CTCCAGGTAG GCCCGTTTTC 
ACGTGGAGCA TAGGAGCCAC GACCCTTCTT AAGACATGTA TCACTGTAGA GGGAAGGAAC 
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AGAGGCCCTG 


GGCCTTCCTA 


TCAGAAGGAC 


ATGGTGAAGG 


CTGGGAACGT 


GAGGAGAGGC 


34 8 0 


AATGGCCACG 


GCCCATTTTG 


GCTGTAGCAC 


ATGGCACGTT 


GGCTGTGTGG 


CCTTGGCCAC 


3540 


CTGTGAGTTT 


AAAGCAAGGC 


TTTAAATGAC 


TTTGGAGAGG 


GTCACAAATC 


CTAAAAGAAG 


3 600 


CATTGAAGTG 


AGGTGTCATG 


GATTAATTGA 


CCCCTGTCTA 


TGGAATTACA 


TGTAAAACAT 


~* /i r\ 

3 660 


TATCTTGTCA 


CTGTAGTTTG 


GTTTTATTTG 


AAAACCTGAC 


AAAAAAAAAG 


TTCCAGGTGT 


"> *~t *S f\ 

37 2 0 


GGAATATGGG 


GGTTATCTGT 


ACATCCTGGG 


GCATTAAAAA 


AAAATCAATG 


GTGGGGAACT 


o *t o r\ 
3780 


ATAAAGAAGT 


AACAAAAGAA 


GTGACATCTT 


CAGCAAATAA 


ACTAGGAAAT 


TTTTTTTTCT 


3 8 4 U 


TCCAGTTTAG 


AATCAGCCTT 


GAAACATTGA 


TGGAATAACT 


CTGTGGCATT 


tv mmst yi tv mm ta m 

ATTGCATTAT 


o r\ n 
390U 


ATACCATTTA 


TCTGTATTAA 


CTTTGGAATG 


TACTCTGTTC 


AATGTTTAAT 


GCTGTGGTTG 


39oU 


ATATTTCGAA 


AGCTGCTTTA 


AAAAAATACA 


TGCATCTCAG 


CGTTTTTTTG 


TTTTTAATTG 


4uz U 


TATTTAGTTA 


TGGCCTATAC 


ACTATTTGTG 


AGCAAAGGTG 


ATCGTTTTCT 


y»mmm,^» ta tv mm 

GTTTGAGATT 


4 080 


TTTATCTCTT 


GATTCTTCAA 


AAGCATTCTG 


AGAAGGTGAG 


ATAAGCCCTG 


AGTCTCAGCT 


4 14 0 


ACCTAAGAAA 


AACCTGGATG 


TCACTGGCCA 


CTGAGGAGCT 


TTGTTTCAAC 


ta ta y» in tk rn m 

CAAGTCATGT 


a o r\ 

42 00 


GCATTTCCAC 


GTCAACAGAA 


TTGTTTATTG 


TGACAGTTAT 


ATCTGTTGTC 


CCTTTGACCT 


/i *5 /r n 
4 2 oO 


TGTTT CTTG A 


AGGTTTCCTC 


GTCCCTGGGC 


AATTCCGCAT 


TTAATTCATG 


y-t m tk mm^i tv /** 7\ 

GTATTCAGGA 


/I o o f\ 

4^20 


T^ACATGCAT 


GTTTGGTTAA 


ACCCATGAGA 


TTCATTCAGT 


TAAAAATCCA 


GATGGCGAA 1 


yi O Q ft 


GACCAGCAGA 


TTCAAATCTA 


TGGTGGTTTG 


ACCTTTAGAG 


AGTTGCTTTA 


CGTGGCC TG I 


/I A A C\ 


TTCAACACAG 


ACCCACCCAG 


AGCCCTCCTG 


CCCTCCTTCC 


GCGGGGGCTT 


TCTCATGGCT 




GTCCTTCAGG 


GTCTTCCTGA 


AATGCAGTGG 


TCGTTACGCT 


CCACCAAGAA 


AGCAGGAAAt 


a r ^ n 


CTGTGGTATG 


AAGCCAGACC 


TCCCCGGCGG 


GCCTCAGGGA 


ACAGAATGAT 


CAGACC 1 I 1 G 


yfl /T o n 


AATGATTCTA 


ATTTTTAAGC 


AAAATATTAT 


TTTATGAAAG 


GTTTACATTG 


m/ 1 ^ tv tv. tv m 7\ rp 

TC AAAGTG A 1 


ft O tt U 


GAATATGGAA 


TATCCAATCC 


TGTGCTGCTA 


TCCTGCCAAA 


ATCATTTTAA 


TGGAGTCAG 1 


4 / ft u 


\ 

j TTGCAGTATG 


CTCCACGTGG 


TAAGATCCTC 


CAAGCTGCTT 


TAGAAGTAAC 


A A m/™* A A /"* A A /~* 

AATGAAGAAt 




Wtfvf ' 

'W'GTGGACGTTT 


TTAATATAAA 


GCCTGTTTTG 


TCTTTTGTTG 


TTGTTCAAAC 


GGGA1 1 LALA 


a o ^ n 

*i O O VJ 


GAGTATTTGA 


AAAATGTATA 


TATATTAAGA 


GGTCACGGGG 


GCTAATTGCT 


AGCTGGC I CjC 




CTTTTGCTGT 


GGGGTTTTGT 


TACCTGGTTT 


TAATAACAGT 


AAATGTGCCC 


AGCC ICii CjO 




CCCCAGAACT 


GTACAGTATT 


GTGGCTGCAC 


TTGCTCTAAG 


AGTAGTTGAT 


CjX ibLAl 1x1 




CCTTATTGTT 


AAAAACATGT 


TAGAAGCAAT 


GAATGTATAT 


AAAAGC 




5086 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG GCG CAC GCT GGG AGA ACG GGG TAG GAC AAC CGG GAG ATA GTG ATG 
Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
1 5 10 15 

AAG TAC ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 
£y"s Tyr lie His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 

GGA GAT GTG GGC GCC GCG CCC CCG GGG GCC GCC CCC GCA CCG GGC ATC 
tly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
* - - 40 45 



35 



TTC TCC TCC CAG CCC GGG CAC ACG CCC CAT CCA GCC GCA TCC CGC GAC 
£he Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
J 50 55 60 

CCG GTC GCC AGG ACC TCG CCG CTG CAG ACC CCG GCT GCC CCC GGC GCC 
Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 »u 

GCC GCG GGG CCT GCG CTC AGC CCG GTG CCA CCT GTG GTC CAC CTG GCC 
A^a Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 
SS Arg G^n Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 
\ . 100 1° 5 J " LU 

£cC GAG ATG TCC AGC CAG CTG CAC CTG ACG CCC TTC ACC GCG CGG GGA 
tla Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
115 120 125 

CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 
Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 I 40 

GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 
Gly Arg lie Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 

145 I 50 1 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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AGC GTC AAC CGG GAG ATG TCG CCC CTG GTG GAC AAC ATC GCC CTG TGG 528 
Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn lie Ala Leu Trp 

165 170 175 

ATG ACT GAG TAC CTG AAC CGG CAC CTG CAC ACC TGG ATC CAG GAT AAC 57 6 

Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp lie Gin Asp Asn 
180 185 190 

GGA GGC TGG GAT GCC TTT GTG GAA CTG TAC GGC CCC AGC ATG CGG CCT 624 
Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Met Arg Pro 
195 200 205 

CTG TTT GAT TTC TCC TGG CTG TCT CTG AAG ACT CTG CTC AGT TTG GCC 67 2 

Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala 
210 215 220 

CTG GTG GGA GCT TGC ATC ACC CTG GGT GCC TAT CTG AGC CAC AAG 717 
Leu Val Gly Ala Cys He Thr Leu Gly Ala Tyr Leu Ser His Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

C (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 

10 15 



1 5 



Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

i Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
\ 65 70 75 80 

a Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 
• 100 105 HO 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 

120 125 




115 



Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 140 
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Gly Arg lie Val 
145 

Ser Val Asn Arg 



Met Thr Glu Tyr 
180 



Gly Gly Trp Asp 
195 

Leu Phe Asp Phe 
210 

Leu Val Gly Ala 
225 



Ala Phe Phe Glu 
150 

Glu Met Ser Pro 
165 

Leu Asn Arg His 



Ala Phe Val Glu 
200 



Ser Trp Leu Ser 
215 

Cys lie Thr Leu 
230 



Phe Gly Gly Val 
155 

Leu Val Asp Asn 
170 

Leu His Thr Trp 
185 

Leu Tyr Gly Pro 



Leu Lys Thr Leu 
220 



Gly Ala Tyr Leu 
235 




Met Cys Val Glu 
160 



lie Ala Leu Trp 
175 

lie Gin Asp Asn 
190 

Ser Met Arg Pro 
205 

Leu Ser Leu Ala 



Ser His Lys 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
* (A) NAME/KEY: CDS 

(B) LOCATION: 1. . 615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG GCG CAC GCT GGG AGA ACG GGG TAC GAC AAC CGG GAG ATA GTG ATG 
Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
1 5 10 15 

AAG TAC ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 
Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

GGA GAT GTG GGC GCC GCG CCC CCG GGG GCC GCC CCC GCA CCG GGC ATC 
\ Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
A\ J * 35 40 45 

t^TTC TCC TCC CAG CCC GGG CAC ACG CCC CAT CCA GCC GCA TCC CGC GAC 
Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 .55 60 

CCG GTC GCC AGG ACC TCG CCG CTG CAG ACC CCG GCT GCC CCC GGC GCC 
Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 ' 70 75 



48 



96 



144 



192 



240 
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GCC GCG GGG CCT GCG CTC AGC CCG GTG CCA CCT GTG GTC CAC CTG GCC 288 
Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 336 
Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 
100 105 HO 



GCC GAG ATG TCC AGC CAG CTG CAC CTG ACG CCC TTC ACC GCG CGG GGA 3 84 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
115 120 125 

CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 4 32 

Ara Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
nn 135 140 



GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 480 
Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145 150 155 160 

AGC GTC AAC CGG GAG ATG TCG CCC CTG GTG GAC AAC ATC GCC CTG TGG 528 
Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn He Ala Leu Trp 

165 170 175 



ATG ACT GAG TAC CTG AAC CGG CAC CTG CAC ACC TGG ATC CAG GAT AAC 
Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp He Gin Asp Asn 
180 185 190 

GGA GGC TGG GTA GGT GCA TCT GGT GAT GTG AGT CTG GGC 
Glv Gly Trp Val Gly Ala Ser Gly Asp Val Ser Leu Gly 
-* 200 205 



\ 195 
« 
1 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 05 amino acids 
I (B) TYPE: amino acid 

(y ( D ) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
1 5 10 15 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 
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576 



615 



Ala Ala Gly Pro 



Leu Arg Gin Ala 
100 



Ala Glu Met Ser 
115 

Arg Phe Ala Thr 
130 

Gly Arg lie Val 
145 

Ser Val Asn Arg 



Met Thr Glu Tyr 
180 



Gly Gly Trp Val 
195 



Ala Leu Ser Pro 
85 

Gly Asp Asp Phe 



Ser Gin Leu His 
120 

Val Val Glu Glu 
135 

Ala Phe Phe Glu 
150 

Glu Met Ser Pro 
165 

Leu Asn Arg His 



Gly Ala Ser Gly 
200 



Val Pro Pro Val 
90 

Ser Arg Arg Tyr 
105 

Leu Thr Pro Phe 



Leu Phe Arg Asp 
140 

Phe Gly Gly Val 
155 

Leu Val Asp Asn 
170 

Leu His Thr Trp 
185 

Asp Val Ser Leu 



Val His Leu Ala 
95 

Arg Gly Asp Phe 
110 

Thr Ala Arg Gly 
125 

Gly Val Asn Trp 



Met Cys Val Glu 
160 

lie Ala Leu Trp 
175 

lie Gin Asp Asn 
190 

Gly 
205 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: YES 

i (ix) FEATURE: 
A (A) NAME/KEY: Modif iedjoase 

& y [d) OTHER 1 INFORMATION: Last two internucleoside linkages are 

phosphorothioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

18 

TCTCCCAGCG TGCGCCAT 



(2) INFORMATION FOR SEQ ID NO: 25: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



-16- 




(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied_base 

(B) LOCATION: 16.. 17 

(D) OTHER INFORMATION: Last two internucleoside linkages are 
phosphorothioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TGCACTCACG CTCGGCCT 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied_base 

(B) LOCATION: 18 . . 19 

(D) OTHER INFORMATION: Last two internucleoside linkages are 
phosphorothioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCGCGGCGGG CGGGCGGGCA 



(2) INFORMATION FOR SEQ ID NO: 27: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

* (iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied_base 

(B) LOCATION: 18 . . 19 . 

(D) OTHER INFORMATION: Last two internucleoside linkages are 
phosphorothioates 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGGCGGAGGC CGGCCGGCGG 



20 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied_base 

(B) LOCATION: 18 . . 19 

(D) OTHER INFORMATION: Last two internucleoside linkages are 
phosphorothioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

AGCGGCGGCG GCGGCAGCGC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

/ ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied_base 

(B) LOCATION: 18 . . 19 . . 
(D) OTHER INFORMATION: Last two internucleoside linkages are 

phosphorothioates 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:29: 
GGGCCGGGAA GGGCGCCCGC 
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